76 research outputs found

    Comparison of methods for partitioning training and validation populations to optimize prediction accuracy and enable across-breed genomic selection in beef cattle

    Get PDF
    Title from PDF of title page (University of Missouri--Columbia, viewed on July 29, 2013).The entire thesis text is included in the research.pdf file; the official abstract appears in the short.pdf file; a non-technical public abstract appears in the public.pdf file.Dissertation advisor: Dr. Jeremy F. TaylorIncludes bibliographical references.Vita.Ph. D. University of Missouri--Columbia 2012.Dissertations, Academic -- University of Missouri--Columbia -- Genetics area program."July 2012"Across-breed genomic selection practices have the potential to revolutionize national genetic evaluation systems in the United States by including commercial cattle and increasing prediction power for hybrid animals. We used a population of 3,240 animals from the Carcass Merit Project to build across-breed genomic selection models for six carcass and four sensory panel traits across five breeds of commercially relevant beef cattle (Angus, Charolais, Hereford, Limousin, and Simmental). Allocation of these animals to training or validation populations based on genetic distance measures or genomic relationships coefficients proved to be no more effective than random allocation of animals. Realized accuracies in these populations showed that the prediction models were very effective when used on animals within the same project and short time span (0.41-0.78). When used in an external validation in animals separated by approximately 10 years, prediction accuracies showed severe reductions (from ~0.6 down to 0.05), indicating that retraining of prediction models will have to be done frequently (possibly annually) in commercial populations. We also identified numerous regions of the genome which showed evidence of harboring causal mutations for these traits of economic importance. These regions will serve as independent validations in the literature as well as a guidepost for researchers looking for causal mutations within the bovine genome.Includes bibliographical reference

    Accuracy of direct genomic breeding values for nationally evaluated traits in US Limousin and Simmental beef cattle

    Get PDF
    BACKGROUND: In national evaluations, direct genomic breeding values can be considered as correlated traits to those for which phenotypes are available for traditional estimation of breeding values. For this purpose, estimates of the accuracy of direct genomic breeding values expressed as genetic correlations between traits and their respective direct genomic breeding values are required. METHODS: We derived direct genomic breeding values for 2239 registered Limousin and 2703 registered Simmental beef cattle genotyped with either the Illumina BovineSNP50 BeadChip or the Illumina BovineHD BeadChip. For the 264 Simmental animals that were genotyped with the BovineHD BeadChip, genotypes for markers present on the BovineSNP50 BeadChip were extracted. Deregressed estimated breeding values were used as observations in weighted analyses that estimated marker effects to derive direct genomic breeding values for each breed. For each breed, genotyped individuals were clustered into five groups using K-means clustering, with the aim of increasing within-group and decreasing between-group pedigree relationships. Cross-validation was performed five times for each breed, using four groups for training and the fifth group for validation. For each trait, we then applied a weighted bivariate analysis of the direct genomic breeding values of genotyped animals from all five validation sets and their corresponding deregressed estimated breeding values to estimate variance and covariance components. RESULTS: After minimizing relationships between training and validation groups, estimated genetic correlations between each trait and its direct genomic breeding values ranged from 0.39 to 0.76 in Limousin and from 0.29 to 0.65 in Simmental. The efficiency of selection based on direct genomic breeding values relative to selection based on parent average information ranged from 0.68 to 1.28 in genotyped Limousin and from 0.51 to 1.44 in genotyped Simmental animals. The efficiencies were higher for 323 non-genotyped young Simmental animals, born after January 2012, and ranged from 0.60 to 2.04. CONCLUSIONS: Direct genomic breeding values show promise for routine use by Limousin and Simmental breeders to improve the accuracy of predicted genetic merit of their animals at a young age and increase response to selection. Benefits from selecting on direct genomic breeding values are greater for breeders who use natural mating sires in their herds than for those who use artificial insemination sires. Producers with unregistered commercial Limousin and Simmental cattle could also benefit from being able to identify genetically superior animals in their herds, an opportunity that has in the past been limited to seed stock animals

    Detection of selective sweeps in cattle using genome-wide SNP data

    Get PDF
    Background: The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps.Results: Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour.Conclusions: The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra-high-density assay scored in a small number of animals from a subset of the breeds. No sweep regions were shared between indicine and taurine breeds reflecting their divergent selection histories and the very different environmental habitats to which these sub-species have adapted.Peer reviewedAnimal Scienc

    Environmental effects on water intake and water intake prediction in growing beef cattle

    Get PDF
    Water is an essential nutrient, but there are few recent studies that evaluate how much water individual beef cattle consume and how environmental factors affect an individual’s water intake (WI). Most studies have focused on WI of whole pens rather than WI of individual animals. Thus, the objective of this study was to evaluate the impact of environmental parameters on individual-animal WI across different seasons and develop prediction equations to estimate WI, including within different environments and management protocols. Individual daily feed intake and WI records were collected on 579 crossbred steers for a 70-d period following a 21-d acclimation period for feed and water bunk training. Steers were fed in 5 separate groups over a 3-yr period from May 2014 to March 2017. Individual weights were collected every 14 d and weather data were retrieved from the Oklahoma Mesonet’s Stillwater station. Differences in WI as a percent of body weight (WI%) were analyzed accounting for average temperature (TAVG), relative humidity (HAVG), solar radiation (SRAD), and wind speed (WSPD). Seasonal (summer vs. winter) and management differences (ad libitum vs. slick bunk) were examined. Regression analysis was utilized to generate 5 WI prediction equations (overall, summer, winter, slick, and ad libitum). There were significant (P \u3c 0.05) differences in WI between all groups when no environmental parameters were included in the model. Although performance was more similar after accounting for all differences in weather variables, significant (P \u3c 0.05) seasonal and feed management differences were still observed for WI%, but were less than 0.75% of steer body weight. The best linear predictors of daily WI (DWI) were dry mater intake (DMI), metabolic body weights (MWTS), TAVG, SRAD, HAVG, and WSPD. Slight differences in the coefficient of determinations for the various models were observed for the summer (0.34), winter (0.39), ad libitum (0.385), slick bunk (0.41), and overall models (0.40). Based on the moderate R2 values for the WI prediction equations, individual DWI can be predicted with reasonable accuracy based on the environmental conditions that are present, MWTS, and DMI consumed, but substantial variation exists in individual animal WI that is not accounted for by these models

    Enhancing Soil Carbon in East Africa: The biophysical evidence, socio-economic incentives, and policy implications

    Get PDF
    This report provides a summary/sythesis of key research outputs and messages gathered from the four year BMZ-funded project on "Scaling up soil carbon enhancement interventions for food security and climate across complex landscapes in Kenya and Ethiopia.

    Accuracies of genomic breeding values in American Angus beef cattle using K-means clustering for cross-validation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genomic selection is a recently developed technology that is beginning to revolutionize animal breeding. The objective of this study was to estimate marker effects to derive prediction equations for direct genomic values for 16 routinely recorded traits of American Angus beef cattle and quantify corresponding accuracies of prediction.</p> <p>Methods</p> <p>Deregressed estimated breeding values were used as observations in a weighted analysis to derive direct genomic values for 3570 sires genotyped using the Illumina BovineSNP50 BeadChip. These bulls were clustered into five groups using K-means clustering on pedigree estimates of additive genetic relationships between animals, with the aim of increasing within-group and decreasing between-group relationships. All five combinations of four groups were used for model training, with cross-validation performed in the group not used in training. Bivariate animal models were used for each trait to estimate the genetic correlation between deregressed estimated breeding values and direct genomic values.</p> <p>Results</p> <p>Accuracies of direct genomic values ranged from 0.22 to 0.69 for the studied traits, with an average of 0.44. Predictions were more accurate when animals within the validation group were more closely related to animals in the training set. When training and validation sets were formed by random allocation, the accuracies of direct genomic values ranged from 0.38 to 0.85, with an average of 0.65, reflecting the greater relationship between animals in training and validation. The accuracies of direct genomic values obtained from training on older animals and validating in younger animals were intermediate to the accuracies obtained from K-means clustering and random clustering for most traits. The genetic correlation between deregressed estimated breeding values and direct genomic values ranged from 0.15 to 0.80 for the traits studied.</p> <p>Conclusions</p> <p>These results suggest that genomic estimates of genetic merit can be produced in beef cattle at a young age but the recurrent inclusion of genotyped sires in retraining analyses will be necessary to routinely produce for the industry the direct genomic values with the highest accuracy.</p

    Comparison of Bayesian models to estimate direct genomic values in multi-breed commercial beef cattle

    Get PDF
    Background: While several studies have examined the accuracy of direct genomic breeding values (DGV) within and across purebred cattle populations, the accuracy of DGV in crossbred or multi-breed cattle populations has been less well examined. Interest in the use of genomic tools for both selection and management has increased within the hybrid seedstock and commercial cattle sectors and research is needed to determine their efficacy. We predicted DGV for six traits using training populations of various sizes and alternative Bayesian models for a population of 3240 crossbred animals. Our objective was to compare alternate models with different assumptions regarding the distributions of single nucleotide polymorphism (SNP) effects to determine the optimal model for enhancing feasibility of multi-breed DGV prediction for the commercial beef industry.Results: Realized accuracies ranged from 0.40 to 0.78. Randomly assigning 60 to 70% of animals to training (n is about 2000 records) yielded DGV accuracies with the smallest coefficients of variation. Mixture models (BayesB95, BayesC[pi]) and models that allow SNP effects to be sampled from distributions with unequal variances (BayesA, BayesB95) were advantageous for traits that appear or are known to be influenced by large-effect genes. For other traits, models differed little in prediction accuracy (~0.3 to 0.6%), suggesting that they are mainly controlled by small-effect loci.Conclusions: The proportion (60 to 70%) of data allocated to training that optimized DGV accuracy and minimized the coefficient of variation of accuracy was similar to large dairy populations. Larger effects were estimated for some SNPs using BayesA and BayesB95 models because they allow unequal SNP variances. This substantially increased DGV accuracy for Warner-Bratzler Shear Force, for which large-effect quantitative trait loci (QTL) are known, while no loss in accuracy was observed for traits that appear to follow the infinitesimal model. Large decreases in accuracy (up to 0.07) occurred when SNPs that presumably tag large-effect QTL were over-regressed towards the mean in BayesC0 analyses. The DGV accuracies achieved here indicate that genomic selection has predictive utility in the commercial beef industry and that using models that reflect the genomic architecture of the trait can have predictive advantages in multi-breed populations.Peer reviewedAnimal Scienc

    Worldwide Patterns of Ancestry, Divergence, and Admixture in Domesticated Cattle

    Get PDF
    The domestication and development of cattle has considerably impacted human societies, but the histories of cattle breeds have been poorly understood especially for African, Asian, and American breeds. Using genotypes from 43,043 autosomal single nucleotide polymorphism markers scored in 1,543 animals, we evaluate the population structure of 134 domesticated bovid breeds. Regardless of the analytical method or sample subset, the three major groups of Asian indicine, Eurasian taurine, and African taurine were consistently observed. Patterns of geographic dispersal resulting from co-migration with humans and exportation are recognizable in phylogenetic networks. All analytical methods reveal patterns of hybridization which occurred after divergence. Using 19 breeds, we map the cline of indicine introgression into Africa. We infer that African taurine possess a large portion of wild African auroch ancestry, causing their divergence from Eurasian taurine. We detect exportation patterns in Asia and identify a cline of Eurasian taurine/indicine hybridization in Asia. We also identify the influence of species other than Bos taurus in the formation of Asian breeds. We detect the pronounced influence of Shorthorn cattle in the formation of European breeds. Iberian and Italian cattle possess introgression from African taurine. American Criollo cattle are shown to be of Iberian, and not African, decent. Indicine introgression into American cattle occurred in the Americas, and not Europe. We argue that cattle migration, movement and trading followed by admixture have been important forces in shaping modern bovine genomic variation.Comment: 38 pages, 15 figures. Various changes made to respond to peer reviews. Mostly, arguments were clarified and additional f-statistics were adde

    Novel analytical method, Birth Date Selection Mapping, detects response of the Angus (Bos taurus) genome to selection on complex traits

    Get PDF
    Background: Several methods have recently been developed to identify regions of the genome that have been exposed to strong selection. However, recent theoretical and empirical work suggests that polygenic models are required to identify the genomic regions that are more moderately responding to ongoing selection on complex traits. We examine the effects of multi-trait selection on the genome of a population of US registered Angus beef cattle born over a 50-year period representing approximately 10 generations of selection. We present results from the application of a quantitative genetic model, called Birth Date Selection Mapping, to identify signatures of recent ongoing selection.Results: We show that US Angus cattle have been systematically selected to alter their mean additive genetic merit for most of the 16 production traits routinely recorded by breeders. Using Birth Date Selection Mapping, we estimate the time-dependency of allele frequency for 44,817 SNP loci using genomic best linear unbiased prediction, generalized least squares, and BayesCpi analyses. Finally, we reconstruct the primary phenotypes that have historically been exposed to selection from a genome-wide analysis of the 16 production traits and gene ontology enrichment analysis.Conclusions: We demonstrate that Birth Date Selection Mapping utilizing mixed models corrects for time-dependent pedigree sampling effects that lead to spurious SNP associations and reveals genomic signatures of ongoing selection on complex traits. Because multiple traits have historically been selected in concert and most quantitative trait loci have small effects, selection has incrementally altered allele frequencies throughout the genome. Two quantitative trait loci of large effect were not the most strongly selected of the loci due to their antagonistic pleiotropic effects on strongly selected phenotypes. Birth Date Selection Mapping may readily be extended to temporally-stratified human or model organism populations.Peer reviewedAnimal Scienc

    A genome triplication associated with early diversification of the core eudicots

    Get PDF
    Background: Although it is agreed that a major polyploidy event, gamma, occurred within the eudicots, the phylogenetic placement of the event remains unclear. Results: To determine when this polyploidization occurred relative to speciation events in angiosperm history, we employed a phylogenomic approach to investigate the timing of gene set duplications located on syntenic gamma blocks. We populated 769 putative gene families with large sets of homologs obtained from public transcriptomes of basal angiosperms, magnoliids, asterids, and more than 91.8 gigabases of new next-generation transcriptome sequences of non-grass monocots and basal eudicots. The overwhelming majority (95%) of well-resolved gamma duplications was placed before the separation of rosids and asterids and after the split of monocots and eudicots, providing strong evidence that the gamma polyploidy event occurred early in eudicot evolution. Further, the majority of gene duplications was placed after the divergence of the Ranunculales and core eudicots, indicating that the gamma appears to be restricted to core eudicots. Molecular dating estimates indicate that the duplication events were intensely concentrated around 117 million years ago. Conclusions: The rapid radiation of core eudicot lineages that gave rise to nearly 75% of angiosperm species appears to have occurred coincidentally or shortly following the gamma triplication event. Reconciliation of gene trees with a species phylogeny can elucidate the timing of major events in genome evolution, even when genome sequences are only available for a subset of species represented in the gene trees. Comprehensive transcriptome datasets are valuable complements to genome sequences for high-resolution phylogenomic analysis
    corecore